A semi-supervised cluster-and-label approach for utterance classification

نویسندگان

  • Amparo Albalate
  • Aparna Suchindranath
  • David Suendermann-Oeft
  • Wolfgang Minker
چکیده

In this paper, we propose a semi-supervised cluster-and-label algorithm for utterance classification. The approach assumes that the underlying class distribution is roughly captured through– fully unsupervised–clustering. Then, a minimum number of labeled examples is used to automatically label the extracted clusters so that the initial label set is ”augmented” to the whole clustered data. The optimum cluster labeling is achieved by means of the Hungarian algorithm, traditionally used to solve optimization assignment problems. Finally, the augmented labeled set is applied to train an SVM classifier. We compare this semi-supervised approach to a fully supervised version in which the initial labeled sets are directly used to train the SVMmodel.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semi-Supervised Learning Based Prediction of Musculoskeletal Disorder Risk

This study explores a semi-supervised classification approach using random forest as a base classifier to classify the low-back disorders (LBDs) risk associated with the industrial jobs. Semi-supervised classification approach uses unlabeled data together with the small number of labelled data to create a better classifier. The results obtained by the proposed approach are compared with those o...

متن کامل

Graph Based Semi-supervised Learning in Computer Vision

OF THE DISSERTATION Graph Based Semi-Supervised Learning in Computer Vision by Ning Huang Dissertation Director: Joseph Wilder Machine learning from previous examples or knowledge is a key element in many image processing and pattern recognition tasks, e.g. clustering, segmentation, stereo matching, optical flow, tracking and object recognition. Acquiring that knowledge frequently requires huma...

متن کامل

Label Propagation for Semi-Supervised Learning in Self-Organizing Maps

Semi-supervised learning aims at discovering spatial structures in high-dimensional input spaces when insufficient background information about clusters is available. A particulary interesting approach is based on propagation of class labels through proximity graphs. The Emergent Self-Organizing Map (ESOM) itself can be seen as such a proximity graph that is suitable for label propagation. It t...

متن کامل

Multi Label Spatial Semi Supervised Classification using Spatial Associative Rule Mining and Evolutionary Algorithms

Multi-label spatial classification based on association rules with multi objective genetic algorithms (MOGA) enriched by semi supervised learning is proposed in this paper. It is to deal with multiple class labels problem. In this paper we adapt problem transformation for the multi label classification. We use hybrid evolutionary algorithm for the optimization in the generation of spatial assoc...

متن کامل

Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data

Most of the semi-supervised classification methods developed so far use unlabeled data for regularization purposes under particular distributional assumptions such as the cluster assumption. In contrast, recently developed methods of classification from positive and unlabeled data (PU classification) use unlabeled data for risk evaluation, i.e., label information is directly extracted from unla...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010